Overview

Dataset Statistics

Number of Variables 18
Number of Rows 128603
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 984
Duplicate Rows (%) 0.8%
Total Size in Memory 31.4 MB
Average Row Size in Memory 256.1 B
Variable Types
  • Categorical: 3
  • Numerical: 15

Dataset Insights

TMAX is skewed Skewed
phh2o is skewed Skewed
ocd is skewed Skewed
cec is skewed Skewed
sand is skewed Skewed
silt is skewed Skewed
clay is skewed Skewed
PRCP is skewed Skewed
county_name has a high cardinality: 622 distinct values High Cardinality
TMIN has 6328 (4.92%) negatives Negatives

Variables


state_name

categorical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 9318007
  • The largest value (NEBRASKA) is over 1.7 times larger than the second largest value (KANSAS)

Length

Mean 7.4556
Standard Deviation 2.183
Median 8
Minimum 4
Maximum 12

Sample

1st row IOWA
2nd row IOWA
3rd row IOWA
4th row IOWA
5th row IOWA

Letter

Count 946843
Lowercase Letter 0
Space Separator 11969
Uppercase Letter 946843
Dash Punctuation 0
Decimal Number 0
  • The largest value (nebraska) is over 1.7 times larger than the second largest value (kansas)

county_name

categorical

Approximate Distinct Count 622
Approximate Unique (%) 0.5%
Missing 0
Missing (%) 0.0%
Memory Size 9212985

Length

Mean 6.639
Standard Deviation 1.8297
Median 6
Minimum 3
Maximum 15

Sample

1st row BUENA VISTA
2nd row BUENA VISTA
3rd row BUENA VISTA
4th row BUENA VISTA
5th row BUENA VISTA

Letter

Count 846738
Lowercase Letter 0
Space Separator 6913
Uppercase Letter 846738
Dash Punctuation 0
Decimal Number 0

month

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 8506838

Length

Mean 1.1481
Standard Deviation 0.3552
Median 1
Minimum 1
Maximum 2

Sample

1st row 4
2nd row 5
3rd row 6
4th row 7
5th row 8

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 147643

year

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 2012.0687
Minimum 2000
Maximum 2023
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • year is skewed left (γ1 = -0.0966)

Quantile Statistics

Minimum 2000
5-th Percentile 2001
Q1 2006
Median 2012
Q3 2017
95-th Percentile 2022
Maximum 2023
Range 23
IQR 11

Descriptive Statistics

Mean 2012.0687
Standard Deviation 6.6138
Variance 43.7421
Sum 2.5876e+08
Skewness -0.0966
Kurtosis -1.0961
Coefficient of Variation 0.003287
  • year is not normally distributed (p-value 0.0)

TMAX

numerical

Approximate Distinct Count 660
Approximate Unique (%) 0.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 82.2587
Minimum 20.5
Maximum 128.8
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • TMAX is skewed left (γ1 = -1.6757)

Quantile Statistics

Minimum 20.5
5-th Percentile 29.74
Q1 82.4
Median 89.2
Q3 94.1
95-th Percentile 102.4
Maximum 128.8
Range 108.3
IQR 11.7

Descriptive Statistics

Mean 82.2587
Standard Deviation 21.551
Variance 464.446
Sum 1.0579e+07
Skewness -1.6757
Kurtosis 1.5672
Coefficient of Variation 0.262
  • TMAX is not normally distributed (p-value 6.401274506320785e-07)
  • TMAX has 18543 outliers

TMIN

numerical

Approximate Distinct Count 787
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 33.8022
Minimum -104.4
Maximum 69.3
Zeros 65
Zeros (%) 0.1%
Negatives 6328
Negatives (%) 4.9%
  • TMIN is skewed left (γ1 = -0.7769)

Quantile Statistics

Minimum -104.4
5-th Percentile 0.11
Q1 23.6
Median 35.6
Q3 48
95-th Percentile 56.5
Maximum 69.3
Range 173.7
IQR 24.4

Descriptive Statistics

Mean 33.8022
Standard Deviation 17.4373
Variance 304.0603
Sum 4.3471e+06
Skewness -0.7769
Kurtosis 1.275
Coefficient of Variation 0.5159
  • TMIN is not normally distributed (p-value 0.0008257280286451011)
  • TMIN has 241 outliers

phh2o

numerical

Approximate Distinct Count 23
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 63.1
Minimum 48
Maximum 81
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • phh2o is skewed right (γ1 = 0.5599)

Quantile Statistics

Minimum 48
5-th Percentile 52
Q1 58
Median 60
Q3 69
95-th Percentile 75
Maximum 81
Range 33
IQR 11

Descriptive Statistics

Mean 63.1
Standard Deviation 7.2192
Variance 52.117
Sum 8.1148e+06
Skewness 0.5599
Kurtosis -0.5156
Coefficient of Variation 0.1144
  • phh2o is not normally distributed (p-value 1.4002974729984577e-11)

ocd

numerical

Approximate Distinct Count 32
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 185.4464
Minimum 92
Maximum 296
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ocd is skewed right (γ1 = 0.6119)

Quantile Statistics

Minimum 92
5-th Percentile 112
Q1 156
Median 181
Q3 216
95-th Percentile 296
Maximum 296
Range 204
IQR 60

Descriptive Statistics

Mean 185.4464
Standard Deviation 50.6856
Variance 2569.035
Sum 2.3849e+07
Skewness 0.6119
Kurtosis 0.02928
Coefficient of Variation 0.2733
  • ocd is not normally distributed (p-value 9.761923488939668e-08)

cec

numerical

Approximate Distinct Count 37
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 197.9567
Minimum 53
Maximum 332
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • cec is skewed left (γ1 = -0.3728)

Quantile Statistics

Minimum 53
5-th Percentile 81
Q1 148
Median 210
Q3 245
95-th Percentile 304
Maximum 332
Range 279
IQR 97

Descriptive Statistics

Mean 197.9567
Standard Deviation 70.6751
Variance 4994.9711
Sum 2.5458e+07
Skewness -0.3728
Kurtosis -0.7592
Coefficient of Variation 0.357
  • cec is not normally distributed (p-value 6.50792178094212e-08)

sand

numerical

Approximate Distinct Count 36
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 237.7671
Minimum 13
Maximum 852
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • sand is skewed right (γ1 = 1.2387)

Quantile Statistics

Minimum 13
5-th Percentile 13
Q1 62
Median 161
Q3 369
95-th Percentile 823
Maximum 852
Range 839
IQR 307

Descriptive Statistics

Mean 237.7671
Standard Deviation 255.2787
Variance 65167.2091
Sum 3.0578e+07
Skewness 1.2387
Kurtosis 0.1974
Coefficient of Variation 1.0737
  • sand is not normally distributed (p-value 9.626880757722324e-08)
  • sand has 1585 outliers

silt

numerical

Approximate Distinct Count 36
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 486.0907
Minimum 89
Maximum 722
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • silt is skewed left (γ1 = -0.8362)

Quantile Statistics

Minimum 89
5-th Percentile 115
Q1 416
Median 552
Q3 623
95-th Percentile 703
Maximum 722
Range 633
IQR 207

Descriptive Statistics

Mean 486.0907
Standard Deviation 177.0834
Variance 31358.5379
Sum 6.2513e+07
Skewness -0.8362
Kurtosis -0.3693
Coefficient of Variation 0.3643
  • silt is not normally distributed (p-value 1.037636869109898e-07)
  • silt has 1585 outliers

clay

numerical

Approximate Distinct Count 35
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 260.8811
Minimum 56
Maximum 407
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • clay is skewed left (γ1 = -0.6924)

Quantile Statistics

Minimum 56
5-th Percentile 58
Q1 185
Median 284
Q3 363
95-th Percentile 379
Maximum 407
Range 351
IQR 178

Descriptive Statistics

Mean 260.8811
Standard Deviation 108.4317
Variance 11757.4387
Sum 3.355e+07
Skewness -0.6924
Kurtosis -0.8234
Coefficient of Variation 0.4156
  • clay is not normally distributed (p-value 2.1226535344564763e-14)

PRCP

numerical

Approximate Distinct Count 893
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 3.4097
Minimum 0
Maximum 38.54
Zeros 767
Zeros (%) 0.6%
Negatives 0
Negatives (%) 0.0%
  • PRCP is skewed right (γ1 = 2.3214)

Quantile Statistics

Minimum 0
5-th Percentile 0.53
Q1 1.75
Median 3
Q3 4.55
95-th Percentile 7.67
Maximum 38.54
Range 38.54
IQR 2.8

Descriptive Statistics

Mean 3.4097
Standard Deviation 2.4192
Variance 5.8527
Sum 438493.53
Skewness 2.3214
Kurtosis 16.1118
Coefficient of Variation 0.7095
  • PRCP is not normally distributed (p-value 1.0832974878462485e-07)
  • PRCP has 3929 outliers

SMS_-8

numerical

Approximate Distinct Count 840
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 26.3272
Minimum 0
Maximum 101.5
Zeros 115
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • SMS_-8 is skewed left (γ1 = -0.0049)

Quantile Statistics

Minimum 0
5-th Percentile 8
Q1 17.5
Median 27.35
Q3 34.7
95-th Percentile 42.5
Maximum 101.5
Range 101.5
IQR 17.2

Descriptive Statistics

Mean 26.3272
Standard Deviation 10.9578
Variance 120.073
Sum 3.3858e+06
Skewness -0.004946
Kurtosis 0.1785
Coefficient of Variation 0.4162
  • SMS_-8 has 177 outliers

TAVG

numerical

Approximate Distinct Count 827
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 58.2246
Minimum 0.9
Maximum 91.6
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • TAVG is skewed left (γ1 = -1.2574)

Quantile Statistics

Minimum 0.9
5-th Percentile 16.81
Q1 51.7
Median 63.3
Q3 71.4
95-th Percentile 77.2
Maximum 91.6
Range 90.7
IQR 19.7

Descriptive Statistics

Mean 58.2246
Standard Deviation 18.03
Variance 325.0796
Sum 7.4879e+06
Skewness -1.2574
Kurtosis 0.8341
Coefficient of Variation 0.3097
  • TAVG has 11802 outliers

WS10M

numerical

Approximate Distinct Count 522
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 4.3657
Minimum 1.35
Maximum 8.13
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • WS10M is skewed left (γ1 = -0.0823)

Quantile Statistics

Minimum 1.35
5-th Percentile 2.84
Q1 3.72
Median 4.38
Q3 5.03
95-th Percentile 5.84
Maximum 8.13
Range 6.78
IQR 1.31

Descriptive Statistics

Mean 4.3657
Standard Deviation 0.933
Variance 0.8705
Sum 561444.56
Skewness -0.08232
Kurtosis -0.2162
Coefficient of Variation 0.2137
  • WS10M is not normally distributed (p-value 0.0007076433361301353)
  • WS10M has 585 outliers

RH2M

numerical

Approximate Distinct Count 2483
Approximate Unique (%) 1.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 68.1678
Minimum 24.86
Maximum 87.52
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • RH2M is skewed left (γ1 = -0.6877)

Quantile Statistics

Minimum 24.86
5-th Percentile 49.79
Q1 62.11
Median 69.36
Q3 75.69
95-th Percentile 81.34
Maximum 87.52
Range 62.66
IQR 13.58

Descriptive Statistics

Mean 68.1678
Standard Deviation 9.7711
Variance 95.4746
Sum 8.7666e+06
Skewness -0.6877
Kurtosis 0.1577
Coefficient of Variation 0.1433
  • RH2M is not normally distributed (p-value 4.398654939228419e-05)
  • RH2M has 1412 outliers

Value

numerical

Approximate Distinct Count 1975
Approximate Unique (%) 1.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2057648
Mean 150.3987
Minimum 0
Maximum 246.7
Zeros 183
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • Value is skewed left (γ1 = -0.7837)

Quantile Statistics

Minimum 0
5-th Percentile 68
Q1 127
Median 157.1
Q3 180.8
95-th Percentile 206.8
Maximum 246.7
Range 246.7
IQR 53.8

Descriptive Statistics

Mean 150.3987
Standard Deviation 41.6556
Variance 1735.193
Sum 1.9342e+07
Skewness -0.7837
Kurtosis 0.433
Coefficient of Variation 0.277
  • Value is not normally distributed (p-value 0.005884713893595329)
  • Value has 2943 outliers

Interactions

Correlations

Missing Values